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F v antibody construct comprising at least four 
variable domains which arc connected to one 
another via peptide linkers 1, 2 and 3. The 
invention also relates to expression pi asm ids 
which code for such an F v antibody construct. 
In addition, the invention relates to a method 
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to the use thereof. 



(57) 



Die vorliegende Erfindung betrifft 
ein multivalentes Fv-Antik6rper-Konstrukt 
mit mindestens vier variablcn Domanen, 
die Qber die Peptidl inker I, 2 und 3 
miteinander verbunden sind. Fcmcr betrifft 
die Erfindung Expressionsplasmide, die 
fur ein solches Fv-AntikCrper-Konstrukt 
codieren, und ein Verfahren zur Hersteliung 
der Fy-Antikorper-Konstnikte sowie deren 
Vcrwendung. 



Promoter Vh-A V l -B V h -B V l -A HiSe 

Epttop t 
EPITOPE a 



Leader 



Linker 1 



Unker3 



linker 2 




CA 02331641 2000-11-03 



Applicant: Deutsches Krebsf orschungszentrum 
Attorney's File: K 2675 



Multivalent Antibody Constructs 

The present invention relates to multivalent F v antibody 
constructs, expression plasmids which code for them, and a 
method for producing the F v antibody constructs as well as 
the use thereof. 

Natural antibodies are dimers and are therefore referred to 
as bivalent. They have four variable domains, namely two V H 
domains and two V L domains. The variable domains serve as 
binding sites for an antigen, a binding site being formed 
from a V H domain and a V L domain. Natural antibodies 
recognize one antigen each, so that they are also referred 
to as monospecific. Furthermore, they also have constant 
domains which add to the stability of the natural 
antibodies. On the other hand, they are also co-responsible 
for undesired immune responses which result when natural 
antibodies of various animal species are administered 
mutually . 

In order to avoid such immune responses, antibodies are 
constructed which lack the constant domains. In particular, 
these are antibodies which only comprise the variable 
domains. Such antibodies are designated F v antibody 
constructs. They are often available in the form of single- 
chain monomers paired with one another. 
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However, it showed that F v antibody constructs only have 
little stability. Therefore, their usability for therapeutic 
purposes is strongly limited. 

Thus, it is the object of the present invention to provide 
an antibody by means of which undesired immune responses can 
be avoided. Furthermore, it shall have a stability which 
makes it usable for therapeutic uses. 

According to the invention this is achieved by the subject 
matters defined in the claims. 

Therefore, the subject matter of the present invention 
relates to a multivalent F v antibody construct which has 
great stability. Such a construct is suitable for diagnostic 
and therapeutic purposes. 

The present invention is based on the applicant's insights 
that the stability of an F v antibody construct can be 
increased if it is present in the form of a single-chain 
dimer where the four variable domains are linked with one 
another via three peptide linkers. The applicant also 
recognized that the F v antibody construct folds with itself 
when the middle peptide linker has a length of about 10 to 
30 amino acids. The applicant also recognized that the F v 
antibody construct folds with other F v antibody constructs 
when the middle peptide linker has a length of about up to 
10 amino acids so as to obtain a multimeric, i.e. 
multivalent, F v antibody construct. The applicant also 
realized that the F v antibody construct can be multi- 
specific. 

According to the invention the applicant's insights are 
utilized to provide a multi-valent F v antibody construct 
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which comprises at least four variable domains which are 
linked with one another via peptide linkers 1, 2 and 3. 

The expression "F v antibody construct" refers to an antibody 
which has variable domains but no constant domains. 

The expression "multivalent F v antibody construct" refers to 
an F v antibody which has several, but at least four, 
variable domains. This is achieved when the single-chain F v 
antibody construct folds with itself so as to give four 
variable domains, or folds with other single-chain F v 
antibody constructs. In the latter case, an F v antibody 
construct is given which has 8, 12, 16, etc., variable 
domains. It is favorable for the F v antibody construct to 
have four or eight variable domains, i.e. it is bivalent or 
tetravalent (cf. Fig. 1). Furthermore, the variable domains 
may be equal or differ from one another, so that the 
antibody construct recognizes one or several antigens. The 
antibody construct preferably recognizes one or two 
antigens, i.e. it is monospecific and bispecific, 
respectively. Examples of such antigens are proteins CD19 
and CD3. 

The expression "peptide linkers 1, 3" refers to a peptide 
linker adapted to link variable domains of an F v antibody 
construct with one another. The peptide linker may contain 
any amino acids, the amino acids glycine (G) , serine (S) and 
proline (P) being preferred. The peptide linkers 1 and 3 may 
be equal or differ from each other. Furthermore, the peptide 
linker may have a length of about 0 to 10 amino acids. In 
the former case, the peptide linker is only a peptide bond 
from the COOH residue of one of the variable domains and the 
NH 2 residue of another of the variable domains. The peptide 
linker preferably comprises the amino acid sequence GG. 
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The expression "peptide linker 2" refers to a peptide linker 
adapted to link variable domains of an F v antibody construct 
with one another. The peptide linker may contain any amino 
acids, the amino acids glycine (G) , serine (S) and proline 
(P) being preferred. The peptide linker may also have a 
length of about 3 to 10 amino acids, in partiuclar 5 amino 
acids, and most particularly the amino acid sequence GGPGS, 
which serves for achieving that the single-chain F v antibody 
construct folds with other single-chain F v antibody 
constructs. The peptide linker can also have a length of 
about 11 to 20 amino acids, in particular 15 to 20 amino 
acids, and most particularly the amino acid sequence (G 4 S) 4 r 
which serves for achieving that the single-chain F v antibody 
construct folds with itself. 

An F v antibody construct according to the invention can be 
produced by common methods. A method is favorable in which 
DNAs coding for the peptide linkers 1, 2 and 3 are ligated 
with DNAs coding for the four variable domains of an F v 
antibody construct such that the peptide linkers link the 
variable domains with one another and the resulting DNA 
molecule is expressed in an expression plasmid. Reference is 
made to Examples 1 to 6. As to the expressions "F v antibody 
construct" and "peptide linker" reference is made to the 
above explanations and, by way of supplement, to Maniatis, 
T. et al.. Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Laboratory 1982. 

DNAs which code for an F v antibody construct according to 
the invention also represent a subject matter of the present 
invention. Furthermore, expression plasmids which contain 
such DNAs also represent a subject matter of the present 
invention. Preferred expression plasmids are pDISC3xl9-LL, 
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pDISC3xl9-SL, pPIODISC-LL, pPIC-DISC-SL, pDISC5-LL and 
pDISC6-SL. The first four were deposited with the DSMZ 
(Deutsche Sammlung fur Mikroorganismen und Zellen) [German- 
type collection for micro-organisms and cells] on April 30, 
1998 under DSM 12150, DSM 12149, DSM 12152 and DSM 12151, 
respectively. 

Another subject matter of the present invention relates to a 
kit, comprising: 

(a) an F v antibody construct according to the invention, 
and/or 

(b) an expression plasmid according to the invention, and 

(c) conventional auxiliary agents, such as buffers, 
solvents and controls. 

One or several representatives of the individual components 
may be present. 

The present invention provides a multivalent F v antibody 
construct where the variable domains are linked with one 
another via peptide linkers. Such an antibody construct 
distinguishes itself in that it contains no parts which can 
lead to undesired immune reactions. Furthermore, it has 
great stability. It also enables to bind several antigens 
simultaneously. Therefore, the F v antibody construct 
according to the invention is perfectly adapted to be used 
not only for diagnostic but also for therapeutic purposes. 
Such purposes can be seen as regards any disease, in 
particular a viral, bacterial or tumoral disease. 
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Brief description of the drawings : 

Fig. 1 shows the genetic organization of an F v antibody 
construct (A) according to the invention and schemes for 
forming a bivalent (B) or tetravalent F v antibody construct 
(C) . Ag: antigen; His6: six C-terminal histidine residues; 
stop: stop codon (TAA) ; V H and V L : variable region of the 
heavy and light chains. 

Fig. 2 shows the scheme for the construction of the plasmids 
pDISC3xl9-LL and pDISC3xl9-SL . c-myc: sequence coding for an 
epitope which is recognized by the antibody 9E1, His 6 : 
sequence which codes for six C-terminal histidine residues; 
PelB: signal peptide sequence of the bacterial pectate lyase 
(PelB leader); rbs: ribosome binding site; Stop: stop codon 
(TAA) ; V H and V L : variable region of the heavy and light 
chains . 

Fig. 3 shows a diagram of the expression plasmid pDISC3xl9- 
LL. 6xHis: sequence which codes for six C-terminal histidine 
residues; bla: gene which codes for ft-lactamase responsible 
for ampicillin resistance; bp: base pairs; c-myc: sequence 
coding for an epitope which is recognized by the 9E10 
antibody; ColEl: origin of the DNA replication; fl-IG: 
intergenic region of the bacteriophage fl; Lac P/O: wt lac- 
operon promoter/operator; linker 1: sequence which codes for 
a GlyGly dipeptide linking the V H and V L domains; linker 2: 
sequence coding for a (Gly 4 Ser) 4 polypeptide which links the 
hybrid scFv fragments; Pel-B leader: signal peptide sequence 
of the bacterial pectate lyase; rbs: ribosome binding site; 
V H and V L : variable region of the heavy and light chains. 

Fig. 4 shows a diagram of the expression plasmid pDISC3x!9- 
SL. 6xHis: sequence which codes for six C-terminal histidine 
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residues; bla : gene which codes for ^-lactamase which is 
responsible for the ampicillin resistance; bp: base pairs; 
c-myc: sequence coding for an epitope recognized by the 9E10 
antibody; ColEl: origin of DNA replication; fl-IG: 
intergenic region of the bacteriophage fl; Lac P/O: wt lac- 
operon promoter/operator: linker 1: sequence which codes for 
a GlyGly dipeptide which links the V H and V L domains; linker 
3: sequence which codes for a GlyGlyProGlySer oligopeptide 
which links the hybrid scFv fragments; Pel-B leader: signal 
peptide sequence of the bacterial pectate lyase; rbs: 
ribosome binding site; V H and V L : variable region of the 
heavy and light chains. 

Fig, 5 shows the nucleotide sequence and the amino acid 
sequence derived therefrom of the bivalent F v antibody 
construct encoded by the expression plasmid pDIS3xl9-LL . c- 
myc epitope: sequence coding for an epitope which is 
recognized by the antibody 9E10; CDR: region determining the 
complementarity; framework: framework region; His6 tail: 
sequence which codes for six C-terminal histidine residues; 
PelB leader: signal peptide sequence of the bacterial 
pectate lyase; RBS: ribosome binding site; V H and V L : 
variable region of the heavy and light chains. 

Fig, 6 shows the nucleotide sequence and the derived amino 
acid sequence of the tetravalent F v antibody construct 
encoded by the expression plasmid pDISC3xl9-SL. c-myc 
epitope: sequence coding for an epitope which is recognized 
by the 9E10 antibody; CDR: region determining 
complementarity; framework: framework region; His6 tail: 
sequence coding for the six C-terminal histidine residues; 
PelB leader: signal peptide sequence of the bacterial 
pectate lyase; RBS: ribosome binding site; V H and V L : 
variable region of the heavy and light chains. 
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Fig. 7 shows the nucleotide sequence and the derived amino 
acid sequence of a connection between a gene which codes for 
an a-factor leader sequence and a gene coding for the 
tetravalent F v antibody construct in the Pichia expression 
plasmid pPIC-DISOSL. Alpha-f actor signal: leader peptide 
sequence of the Saccharomyces cerevisiae-a factor secretion 
signal; V H : variable region of the heavy chain". Rhombs 
indicate the signal cleaving sites. 

Fig. 8 shows the nucleotide sequence and the derived amino 
acid sequence of a connection between a gene coding for an 
a-factor leader sequence and a gene which . codes for the 
bivalent F v antibody construct in the Pichia expression 
plasmid pPIC-DISC-LL. Alpha-factor signal: leader peptide 
sequence of the Saccharomyces cerevisiae-a factor secretion 
signal; V H : variable region of the heavy chain. Rhombs show 
the signal cleaving sites. 

Fig. 9 shows a diagram of the expression plasmid pDISC5-LL. 
6xHis: sequence coding for six C-terminal histidine 
residues; bla: gene which codes for ft-lactamase responsible 
for ampicillin resistance; bp: base pairs; c-myc: sequence 
coding for an epitope which is recognized by the 9E10 
antibody; hok-sok: plasmid-stabilizing DNA locus; LacI: gene 
which codes for the Lac repressor; Lac P/0: wt lac-operon- 
promoter/operator; LacZ 1 : gene which codes for the a-peptide 
of li-galactosidase; linker 1: sequence which codes for a 
GlyGly dipeptide connecting the V H and V L domains; linker 2: 
sequence which codes for a (Gly 4 Ser)4 polypeptide linking 
the hybrid scFv fragments; M13 IG: intergenic region of the 
M13 bacteriophage; pBR322ori: origin of DNA replication; 
Pel-B leader: signal peptide sequence of the bacterial 
pectate lyase; rbs: ribosome binding site which originates 
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from the E. coli lacZ gene (lacZ), from the bacteriophage T7 
gene 10 (T7gl0) or from the E. coli skp gene (skp) ; skp: 
gene which codes for the bacterial periplasmic factor 
Skp/OmpH; tHP: strong transcription terminator; tIPP: 
transcription terminator; V H and V L : variable region of the 
heavy and light chains. 

Fig. 10 shows a diagram of the expression plasmid pDISC6-SL. 
6xHis: sequence which codes for six C-terminal histidine 
residues; bla: gene which codes for 6-lactamase responsible 
for ampicillin resistance; bp: base pairs: c-myc: sequence 
coding for an epitope which is recognized by the 9E10 
antibody; hok-sok: plasmid-stabilized DNA locus; LacI: gene 
which codes for the Lac repressor; Lac P/0: wt lac-operon 
promoter/operator; LacZ 1 : gene which codes for the a-peptide 
of ft-galactosidase; linker 1: sequence which codes for a 
GlyGly dipeptide which links the V H and V L domains; linker 
3: sequence which codes for a GlyGlyProGlySer oligopeptide 
linking the hybrid scFv fragments: M13 IG: intergenic region 
of the M13 bacteriophage; pBR322ori: origin of DNA 
replication; Pel-B leader: signal peptide sequence of the 
bacterial pectate lyase; rbs : ribosome binding site 
originating from the E. coli lacZ gene (lacZ), from the 
bacteriophage T7 gene 10 (T7gl0) or from the E. coli skp 
gene (skp); skp: gene which codes for the bacterial 
periplasmic factor Skp/OmpH; tHP: strong transcription 
terminator; tIPP: transcription terminator; V H and V L : 
variable region of the heavy and light chains. 

The invention is explained by the below examples. 
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Example 1: Construction of the plasmids pDISC3xl9-LL and 

pDISC3xl9-SL for the expression of bivalent, 
bispecific and/or tetravalent, bispecific F v 
antibody constructs in bacteria 

The plasmids pH0G-aCDl9 and pH0G-dm0KT3 which code for the 
scFv fragments derived from the hybridoma HD37 which is 
specific to human CD19 (Kipriyanov et al., 1996, J. -Immunol. 
Meth. 196 , 51-62) and from the hybridoma 0KT3 which is 
specific to human CD3 (Kipriyanov et al., 1997, Protein 
Eng. 10, 445-453), respectively, were used for the 
construction of expression plasmids for a single-chain F v 
antibody construct. A PCR fragment 1 of the V H domain of 
anti-CD19, followed by a segment which codes for a GlyGly 
linker, was produced using the primers DPI, 5'- 
TC ACACA GAATTC -TTAGATCTATTAAAGAGGAGAAATTAACC , and DP2 , 5 1 - 
AGCACAC GATATC ACCGCCAAGCTTGGGTGTTGTTTTGGC (cf . Fig- 2) . The 
PCR fragment 1 was cleaved by EcoRI and EcoRV and ligated 
with the EcoRl/EcoRV-linearized plasmid pH0G-dm0KT3 so as to 
produce the vector pH0G19-3. The PCR fragment 2 of the V L 
domain of anti-CD19, followed by a segment which codes for a 
c-myc epitope and a hexahistidinyl tail, was produced using 
the primers DP3, 5 1 -AGCACAC AAGCTT GGCGGTGATATCTTGCTCACCCAAAC- 
TCCA, and DP4, 5 ' -AGCACACTCTAGAGACACAC AGATCT TTAGTGATGGTGAT- 
GGTGATGTGAGTTTAGG. The PCR fragment 2 was cleaved by Hindlll 
and Xbal and ligated with the HIndlll/Xbal-linearized 
plasmid pH0G-dm0KT3 so as to obtain the vector pHOG3-19 (cf. 
Fig. 2). The gene coding for the hybrid scFv-3-19 in the 
plasmid pHOG3-19 was amplified by means of PCR with the 
primers Bi3sk, 5 ' -CAGCCGG CCATGG CGCAGGTGCAACTGCAGCAG and 
either Li-1, 5 ■ -TATATACTGCAGCTGCACCTGGCTACCACCACCACCGGAGCCG- 
CCACCACCGCTACCACCGCCGCCAGAACCACCACCACCAGCGGCCGCAGCATCAGCCCG, 
for the production of a long flexible (Gly 4 Ser) 4 inter-scFV 
linker {PCR fragment 3, cf. Fig. 2) or Li-2, 5 T -TAT AT A- 
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CTGCAGCTGCACCTGCGACCCTGGGCCACCAGCGGCCGCAGC ATCAGCCCG , for the 
production of a short rigid GGPGS linker (PCR fragment 4, 
cf - Fig. 2) . The expression plasmids pDISC3xl9-LL and 
pDISC3xl9-SL were constructed by ligating the NcoI/PvuII 
restriction fragment from pHOG19-3, comprising the vector 
framework and the NcoI/PvuII-cleaved PCR fragments 3 and 4, 
respectively (cf. Figs. 3, 4). The complete nucleotide and 
protein sequences of the bivalent and tetravalent F v 
antibody constructs are indicated in Figs 5 and 6, 
respectively. 

Example 2: Construction of the plasmids pPIC-DISC-LL and 

pPIC-DISC-SL for the expression of bivalent, 
bispecific and/or tetravalent, bispecific P v 
antibody constructs in yeast 

(A) Construction of pPIC-DISC-SL 

The vector pPICZaA (Invitrogen BV, Leek, Netherlands) for 
the expression and secretion of recombinant proteins in the 
yeast Pichia pastoris was used as a starting material. It 
contains a gene which codes for the Saccharomyces cerevisiae 
a-f actor secretion signal, followed by a polylinker. The 
secretion of this vector is based on the dominant selectable 
marker, Zeocin™ which is bifunctional in both Pichia and E. 
coli. The gene which codes for the tetravalent F v antibody 
construct (scDia-SL) was amplified by means of PCR by the 
template pDISC3xl9-SL using the primers 5-PIC, 5 1 - 
CCGT GAATTC CAGGTGCAACTGCAGCAGTCTGGGGCTGAACTGGC, and pSEXBn 
5 1 -GGTCGACGTTAACCGACAAACAACAGATAAAACG . The resulting PCR 
product was cleaved by EcoRI and Xbal and ligated in 
EcoRI/Xbal-linearized pPICZaA. The expression plasmid pPIC- 
DISC-SL was obtained. The nucleotide and protein sequences 
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of the tetravalent F v antibody construct are shown in Fig. 
7. 

(B) Construction of pPIODISOLL 

The construction of pPIC-DISC-LL was carried out on the 
basis of pPICZotA (Invitrogen BV, Leek, Netherlands) and 
pDISC3xl9-LL (cf. Fig. 3). The plasmid-DNA pPICZoA was 
cleaved by EcoRI. The overhanging 5 1 -ends were filled using 
a Klenow fragment of the E. coli DNA polymerase I. The 
resulting DNA was cleaved by Xbal, and the large fragment 
comprising the pPIC vector was isolated. Analogous thereto 
the DNA of pDISC3xl9-LL was cleaved by Ncol and treated with 
a Klenow fragment. Following the cleavage using Xbal a small 
fragment, comprising a gene coding for the bivalent F v 
antibody, was isolated. Its ligation with a pPIOderived 
vector-DNA resulted in the plasmid pPIC-DISC-LL. The 
nucleotide and protein sequences of the bivalent F v antibody 
construct are shown in Fig. 8. 

Example 3: Expression of the tetravalent and/or bivalent 

F v antibody construct in bacteria 

E. coli XLl-blue cells (Strategene, La Jolla, CA) which had 
been transformed with the expression plasmids pDISC3xl9-LL 
and pDISC3xl9-SL, respectively, were cultured overnight in 
2xYT medium with 50 pg/ml ampicillin and 100 mM glucose 
(2xYT Ga ) at 37°C. 1:50 dilutions of the overnight cultures 
in 2xYT G a were, cultured as flask cultures at 37 °C while 
shaking with 200 rpm. When the cultures had reached an OD 6 oo 
value of 0.8, the bacteria were pelleted by 10-minute 
centrifugation with 1500 g at 20°C and resuspended in the 
same volume of a fresh 2xYT medium containing 50 pg/ml 
ampicillin and 0.4 M saccharose. IPTG was added up to a 
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final concentration of 0.1 mM, and the growth was continued 
at room temperature (20~22°C) for 18 - 20 h. The cells were 
harvested by 10-minute centrif ugation with 5000 g at 4°C. 
The culture supernatant was held back and stored on ice. In 
order to isolate the soluble periplasmic proteins, the 
pelleted bacteria were resuspended in 5 % of the initial 
volume of ice-cold 50 mM Tris-HCl, 20 % saccharose, 1 mM 
EDTA, pH 8.0. Following 1 hour of incubation on ..ice with 
occasional stirring the spheroplasts were centrifuged with 
30,000 g at 4°C for 30 minutes, the soluble periplasmic 
extract being obtained as supernatant and the spheroplasts 
with the insoluble periplasmic material being obtained as 
pellet. The culture supernatant and the soluble periplasmic 
extract were combined and clarified by further 
centrifugation (30,000 g, 4°C, 40 min.). The recombinant 
product was concentrated by ammonium sulfate precipitation 
(final concentration 70 % saturation). The protein 
precipitate was obtained by centrifugation (10,000 g, 4°C, 
40 min.) and dissolved in 10 % of the initial volume of 50 
mM Tris-HCl, 1 M NaCl, pH 7.0. An immobilized metal affinity 
chromatography (IMAC) was carried out at 4°C using a 5 ml 
column of chelating sepharose (Pharmacia) which was charged 
with Cu 2 * and had been equilibrated with 50 mM Tris-HCl, 1 M 
NaCl, pH 7.0 (starting buffer). The sample was loaded by 
passing it over the column. It was then washed with twenty 
column volumes of starting buffer, followed by starting 
buffer with 50 mM imidazole until the absorption at 280 nm 
of the effluent was at a minimum (about thirty column 
volumes) . The absorbed material was eluted with 50 mM Tris- 
HCl, 1 M NaCl, 250 mM imidazole, pH 7.0. 

The protein concentrations were determined with the Bradford 
dye binding test (1976, Anal. Biochem. 72, 248-254) using 
the Bio-Rad (Munich, Germany) protein assay kit. The 
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concentrations of the purified tetravalent and bivalent F v 
antibody constructs were determined from the A 2 so values 
using the extinction coefficients e lmg/mi = 1.96 and 1.93, 
respectively. 

Example 4: Expression of the tetravalent and/or bivalent 

antibody construct in the yeast Pichia 
pas tor is 

Competent P. pastoris GS155 cells (Invitrogen) were 
electroporated in the presence of 10 pg plasmid-DNA of pPTO 
DISC-LL and pPIC-DISC-SL, respectively, which had been 
linearized with Sad. The transf ormants were selected for 3 
days at 30°C on YPD plates containing 100 pg/ml Zeocin™. 
The clones which secreted the bivalent and/or tetravalent F v 
antibody constructs were selected by plate screening using 
an anti-c-myc-mAk 9E10 (IC Chemikalien, Ismaning, Germany) . 

For the expression of the bivalent F v antibody constructs 
and tetravalent F v antibody constructs, respectively, the 
clones were cultured in YPD medium in shaking flasks for 2 
days at 30°C with stirring. The cells were centrifuged 
resuspended in the same volume of the medium containing 
methanol and incubated for another 3 days at 30°C with 
stirring. The supernatants were obtained after the 
centrifugation. The recombinant product was isolated by 
ammonium sulfate precipitation, followed by IMAC as 
described above. 

Example 5: Characterization of the tetravalent F v 

antibody construct and bivalent F v antibody 
construct, respectively , 



(A) Size exclusion chromatography 
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An analytical gel filtration of the F v antibody constructs 
was carried out in PBS using a superdex 200-HR10/30 column 
(Pharmacia) . The sample volume and the flow rate were 200 
pl/min and 0.5 ml/min, respectively. The column was 
calibrated with high-molecular and low-molecular gel 
filtration calibration kits (Pharmacia) . 

(B) Flow cytometry 

The human CD3 + /CD19~-acute T-cell leukemia line Jurkat and 
the CD19 + /CD3" B-cell line JOK-1 were used for flow 
cytometrie. 5 x 10 5 cells in 50 pi RPMI 1640 medium (GIBCO 
BRL, Eggestein, Germany) which was supplemented with 10 % 
FCS and 0.1 % sodium azide (referred to as complete medium) 
were incubated with 100 pi of the F v antibody preparations 
for 4 5 minutes on ice. After washing using the complete 
medium the cells were incubated with 100 pi 10 pg/ml anti-c- 
myc-Mak 9E10 (IC Chemikalien) in the same buffer for 45 min 
on ice. After a second wash cycle, the cells were incubated 
with 100 pi of the FITC-labeled goat-anti-mouse-IgG (GIBCO 
BRL) under the same conditions as before. The cells were 
then washed again and resuspended in 100 pi 1 pg/ml 
propidium iodide solution (Sigma, Deisenhofen, Germany) in 
complete medium with the exclusion of dead cells. The 
relative fluorescence of the stained cells was measured 
using a FACScan flow cytometer (Becton Dickinson, Mountain 
View, CA) . 

(C) Cytotoxicity test 

The CD19-expressing Burkitt lymphoma cell line Raji and 
Namalwa were used as target cells. The cells were incubated 
in RPMI 1640 (GIBCO BRL) which was supplemented with 10 % 
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heat-inactivated FCS (GIBCO BRL), 2 mM glutamine and 1 mM 
pyruvate, at 37 °C in a dampened atmosphere with 7.5 % C0 2 . 
The cytotoxic T-cell tests were carried out in RPMI-1640 
medium supplemented with 10 % FCS, 10 mM HEPES, 2 mM 
glutamine, 1 mM pyruvate and 0.05 mM 2-ME. The cytotoxic 
activity was evaluated using a standard [ 51 Cr] release test; 
2 x 10 6 target cells were labeled with 200 pCi Na[ 51 Cr]0 4 
(Amersham-Buchler, Braunschweig, Germany) and washed 4 times 
and then resuspended in medium in a concentration of 2 x 
10 5 /ml. The effector cells were adjusted to a concentration 
of 5 x 10 6 /ml. Increasing amounts of CTLs in 100 pi were 
titrated to 10 4 target cells/well or cavity in 50 pi. 50 pi 
antibodies were added to each well. The entire test was 
prepared three times and incubated at 37°C for 4 h. 100 pi 
of the supernatant were collected and tested for [ 51 Cr] 
release in a gamma counter (Cobra Auto Gamma; Canberra 
Packard, Dreieich, Germany) . The maximum release was 
determined by incubation of the target cells in 10 % SDS, 
and the spontaneous release was determined by incubation of 
the cells in medium alone. The specific lysis (%) was 
calculated as: (experimental release - spontaneous 
release) / (maximum release - spontaneous release) x 100. 

Exainple 6: Construction of the plasmids pDISC5-LL and 

pDISC5-SL for the expression of bivalent, 
bispecific and/or tetravalent, bispecific F v 
antibody constructs in bacteria by high cell 
density fermentation 

Expression vectors were prepared which contained the hok/sok 
plasmid-free cell suicide system and a gene which codes for 
the Skp/OmpH periplasmic factor for a greater production of 
recombinant antibodies. The skp gene was amplified by PCR 
using the primers skp-1, 5'-CGA ATT CTT AAG ATA AGA AGG AGT 
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TTA TTG TGA AAA AGT GGT TAT TAG CTG CAG G and skp-2, 5 1 -CGA 
ATT AAG CTT CAT TAT TTA ACC TGT TTC AGT ACG TCG G using the 
plasmid pGAH317 (Hoick and Kleppe, 1988, Gene 67, 117-124). 
The resulting PCR fragment was cleaved by Aflll and Hindlll 
and inserted in the Af 111/HindIII-linearized plasmid pHKK 
(Horn et al., 1996, Appl . Microbiol. Biotechnol. 46, 524- 
532) so as to obtain the vector pSKK. The genes obtained in 
the plasmids pDISC3xl9-LL and pDISC3xl9-SL and coding for 
the scFv antibody constructs were amplified by means of the 
primers fe-1, 5' -CGA ATT TCT AGA TAA GAA GGA GAA ATT AAC CAT 
GAA ATA CC and fe-2, 5 '-CGA ATT CTT AAG CTA TTA GTG ATG GTG 
ATG GTG ATG TGA G. The Xbal/Af II I-cleaved PCR fragments were 
inserted in pSKK before the skp insert so as to obtain the 
expression plasmids pDISC5-LL and pDISC6-SL, respectively, 
which contain tri-cistronic operons under the control of the 
lac promoter/operator system (cf. figs. 9, 10). 
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SEQUENCE RECORD 



(1) GENERAL INDICATIONS: 

(i) APPLICANT: 

(A) NAME: Deutsches Krebsf orschungszentrum 

(B) STREET: Im Neuenheimer Feld 280 

(C) TOWN: Heidelberg 

(E) COUNTRY: Germany 

(F) POSTAL CODE: 69120 

(ii) TITLE OF THE INVENTION: Multivalent 'Antibody 
Constructs 

(iii) NUMBER OF SEQUENCES: 17 

(iv) COMPUTER-READABLE VERSION: 

(A) DATA CARRIER: floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, version 
#1.30 (EPA) 



(2) INDICATIONS AS TO SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1698 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: genome DNA 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 
(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) POSITION : 28. .1689 
(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) POSITION: 28.. 1689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GAATTCATTA AAGAGGAGAA ATTAACC ATG AAA TAC CTA TTG CCT ACG GCA 51 

Met Lys Tyr Leu Leu Pro Thr Ala 
1 5 
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GCC GCT GGC TTG CTG CTG CTG GCA GCT CAG CCG GCC ATG GCG CAG GTG 
Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala Gin Val 
10 15 20 



99 



CAA CTG CAG CAG TCT GGG GCT GAA CTG GCA AGA CCT GGG GCC TCA GTG 
Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val 
25 30 35 40 



147 



AAG ATG TCC TGC AAG GCT TCT GGC TAC ACC TTT ACT AGG TAC ACG ATG 
Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr Thr Met- 
45 50 55 



195 



CAC TGG GTA AAA CAG AGG CCT GGA CAG GGT CTG GAA TGG ATT GGA TAC 
His Trp val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp lie Gly Tyr 
60 65 70 



243 



ATT AAT CCT AGC CGT GGT TAT ACT AAT TAC AAT CAG AAG TTC AAG GAC 
lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lvs Phe Lys Asp 
75 80 85 



291 



AAG GCC ACA TTG ACT ACA GAC AAA TCC TCC AGC ACA GCC TAC ATG CAA 
Lys Ala Thr Leu Thr Thr Asp Lys Ser Ser Ser Thr Ala Tyr Met Gin 
90 95 100 



339 



CTG AGC AGC CTG ACA TCT GAG GAC TCT GCA GTC TAT TAC TGT GCA AGA 
Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys Ala Arg 
105 110 115 120 



387 



TAT TAT GAT GAT CAT TAC AGC CTT GAC TAC TGG GGC CAA GGC ACC ACT 
Tyr Tyr Asp Asp His Tyr Ser Leu Asp Tyr Trp Gly Gin Gly Thr Thr 
125 130 135 



435 



CTC ACA GTC TCC TCA GCC AAA ACA ACA CCC AAG CTT GGC GGT GAT ATC 
Leu Thr Val Ser Ser Ala Lys Thr Thr Pro Lys Leu Gly Gly Asp lie 
140 145 150 



483 



TTG CTC ACC CAA ACT CCA GCT TCT TTG GCT GTG TCT CTA GGG CAG AGG 
Leu Leu Thr Gin Thr Pro Ala Ser Leu Ala Val Ser Leu Gly Gin Arg 
155 160 165 



531 



GCC ACC ATC TCC TGC AAG GCC AGC CAA AGT GTT GAT TAT GAT GGT GAT 
Ala Thr lie Ser Cys Lys Ala Ser Gin Ser Val Asp Tyr Asp Gly Asp 
170 175 180 



579 



AGT TAT TTG AAC TGG TAC CAA CAG ATT CCA GGA CAG CCA CCC AAA CTC 
Ser Tyr Leu Asn Trp Tyr Gin Gin lie Pro Gly Gin Pro Pro Lys Leu 
185 " 190 195 200 



627 



CTC ATC TAT GAT GCA TCC AAT CTA GTT TCT GGG ATC CCA CCC AGG TTT 
Leu He Tyr Asp Ala Ser Asn Leu Val Ser Gly He Pro Pro Arg Phe 
205 210 215 



675 



AGT GGC AGT GGG TCT GGG ACA GAC TTC ACC CTC AAC ATC CAT CCT GTG 
Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Asn He His Pro Val 
220 225 230 



723 
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GAG AAG GTG GAT GCT GCA ACC TAT CAC TGT CAG CAA AGT ACT GAG GAT 771 

Glu Lys Val Asp Ala Ala Thr Tyr His Cys Gin Gin Ser Thr Glu Asp 
235 240 245 

CCG TGG ACG TTC GGT GGA GGC ACC AAG CTG GAA ATC AAA CGG GCT GAT 819 
Pro Trp Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys Arg Ala Asp 
250 255 260 

GCT GCG GCC GCT GGT GGT GGT GGT TCT GGC GGC GGT GGT AGC GGT GGT 867 
Ala Ala Ala Ala Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly- 
265 270 275 280 

GGC GGC TCC GGT GGT GGT GGT AGC CAG GTG CAG CTG CAG CAG TCT GGG 915 
Gly Gly Ser Gly Gly Gly Gly Ser Gin Val Gin Leu Gin Gin Ser Gly 
285 290 295 

GCT GAG CTG GTG AGG CCT GGG TCC TCA GTG AAG ATT TCC TGC AAG GCT 963 
Ala Glu Leu Val Arg Pro Gly Ser Ser Val Lys He Ser Cys Lys Ala 
300 305 310 

TCT GGC TAT GCA TTC AGT AGC TAC TGG ATG AAC TGG GTG AAG CAG AGG 1011 
Ser Gly Tyr Ala Phe Ser Ser Tyr Trp Met Asn Trp Val Lys Gin Arg 
315 320 325 

CCT GGA CAG GGT CTT GAG TGG ATT GGA CAG ATT TGG CCT GGA GAT GGT 1059 
Pro Gly Gin Gly Leu Glu Tro He Gly Gin He Trp Pro Gly Asp Gly 
330 335 340 

GAT ACT AAC TAC AAT GGA AAG TTC AAG GGT AAA GCC ACT CTG ACT GCA 1107 
Asp Thr Asn Tyr Asn Gly Lys Phe Lys Gly Lys Ala Thr Leu Thr Ala 
345 350 355 360 

GAC GAA TCC TCC AGC ACA GCC TAC ATG CAA CTC AGC AGC CTA GCA TCT 1155 
Aso Glu Ser Ser Ser Thr Ala Tyr Mec Gin Leu Ser Ser Leu Ala Ser 
365 370 375 

GAG GAC TCT GCG GTC TAT TTC TGT GCA AGA CGG GAG ACT ACG ACG GTA 1203 
Glu Aso Ser Ala Val Tyr Phe Cys Ala Arg Arg Glu Thr Thr Thr Val 
380 385 390 

GGC CGT TAT TAC TAT GCT ATG GAC TAC TGG GGT CAA GGA ACC TCA GTC 1251 
Gly Arg Tyr Tyr Tyr Ala Mec Asp Tyr Trp Gly Gin Gly Thr Ser Val 
395 400 405 

ACC GTC TCC TCA GCC AAA ACA ACA CCC AAG CTT GGC GGT GAT ATC GTG 1299 
Thr Val Ser Ser Ala Lys Thr Thr Pro Lys Leu Gly Gly Asp He Val 
410 415 420 

CTC ACT CAG TCT CCA GCA ATC ATG TCT GCA TCT CCA GGG GAG AAG GTC 1347 
Leu Thr Gin Ser Pro Ala He Met Ser Ala Ser Pro Gly Glu Lys Val 
425 430 435 440 

ACC ATG ACC TGC AGT GCC AGC TCA AGT GTA AGT TAC ATG AAC TGG TAC 1395 
Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met Asn Trp Tyr 
44 5 45.0 4 55 
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CAG CAG AAG TCA GGC ACC TCC CCC AAA AGA TGG ATT TAT GAC ACA TCC 

Gin Gin Lys Ser Gly Thr Ser Pro Lys Arg Trp lie Tyr Asp Thr Ser 

460 46S 470 

AAA CTG GCT TCT GGA GTC CCT GCT CAC TTC AGG GGC AGT GGG TCT GGG 
Lvs Leu Ala Ser Gly Val Pro Ala His Phe Arg Gly Ser Gly Ser Gly 
475 480 455 

ACC TCT TAG TCT CTC ACA ATC AGC GGC ATG GAG GCT GAA GAT GCT GCC 
Thr Ser Tyr Ser Leu Thr He Ser Gly Met Glu Ala Glu Asp Ala Ala- 
490 495 500 

ACT TAT TAG TGC CAG CAG TGG AGT AGT AAC CCA TTC ACG TTC GGC TCG 
Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr Phe Gly Ser 
505 510 515 520 

GGG ACA AAG TTG GAA ATA AAC CGG GCT GAT ACT GCA CCA ACT GGA TCC 
Gly Thr Lys Leu Glu lie Asn Arg Ala Asp Thr Ala Pro Thr Gly Ser 
525 530 535 

GAA CAA AAG CTG ATC TCA GAA GAA GAC CTA AAC TCA CAT CAC CAT CAC 
Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn Ser His His His His 
540 545 550 

CAT CAC TAATCTAGA 
His His 



(2) INDICATIONS AS TO ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 amino acids 

(B) KIND: amino acid 
(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
15 10 15 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu 
20 25 30 

Leu Ala Arg Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly 
35 40 45 

Tyr Thr Phe Thr Arg Tyr Thr Met Kis Trp Val Lys Gin Arg Pro Gly 
50 55 60 

Gin Gly Leu Glu Trp He Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr 
65 70 75 80 
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Asn Tyr Asn Gin Lys Phe Lys Asp Lys Ala Thr Leu Thr Thr Asd Lys 

85 90 95 

Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp 

100 105 110 

Ser Ala Val Tyr Tyr Cys Ala Arg Tvr Tyr Asp Asp His Tyr Ser Leu 

115 120 125 



Asd Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr 
130 135 140 

Thr Pro Lys Leu Gly Gly Asd lie Leu Leu Thr Gin Thr Pro Ala Ser 
145 150 155 160 

Leu Ala Val Ser Leu Gly Gin Arg Ala Thr lie Ser Cys Lys Ala Ser 
165 170 175 

Gin Ser Val Asp Tyr Asp Gly Asp Ser Tyr Leu Asn Trp Tyr Gin Gin 
180 185 190 

lie Pro Gly Gin Pro Pro Lys Leu Leu He Tyr Asp Ala Ser Asn Leu 
195 200 205 

Val Ser Gly He Pro Pro Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
210 215 220 

Phe Thr Leu Asn He His Pro Val Glu Lys Val Asp Ala Ala Thr Tyr 
225 230 235 240 

His Cys Gin Gin Ser Thr Glu Asp Pro Trp Thr Phe Gly Gly Gly Thr 
245 250 255 

Lys Leu Glu He Lys Arg Ala Asp Ala Ala Ala Ala Gly Gly Gly Gly 
260 265 270 

Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
275 280 285 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg Pro Gly Ser 
290 295 300 

Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Ala Phe Ser Ser Tyr 
305 310 315 320 

Trp Met Asn Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp He 
325 330 335 

Gly Gin He Trp Pro Gly Asp Gly Asp Thr Asn Tyr Asn Gly Lys Phe 
340 345 350 

Lys Gly Lys Ala Thr Leu Thr Ala Asp Glu Ser Ser Ser Thr Ala Tyr 
355 ' 360 365 
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Met Gin Leu Ser Ser Leu Ala Ser Glu Asd Ser Ala Val Tyr Phe Cys 
370 375 380 

Ala Arg Arg Glu Thr Thr Thr Val Gly Arg Tvr Tyr Tyr Ala Met Asd 
385 390 395 400 

Tyr Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser Ala Lys Thr Thr 
405 410 415 

Pro Lys Leu Gly Gly Asd lie Val Leu Thr Gin Ser Pro Ala lie Met 
420 425 430 

Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser 
435 440 445 

Ser Val Ser Tvr Met Asn Trp Tyr Gin Gin Lvs Ser Gly Thr Ser Pro 
450 455 460 

Lys Arg Trp lie Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala 
465 470 475 480 

His Phe Arg Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He Ser 
485 490 495 

Glv Met Glu Ala Glu Asp Ala Ala Thr Tvr Tyr Cys Gin Gin Tro Ser 
500 505 510 

Ser Asn Pro Phe Thr Phe Gly Ser Gly Thr Lys Leu Glu He Asn Arg 
515 520 525 

Ala Asp Thr Ala Pro Thr Gly Ser Glu Gin Lys Leu He Ser Glu Glu 
530 535 540 

Asp Leu Asn Ser His His His His His His 
545 550 



INDICATIONS AS TO ID NQ : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1653 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: genome DNA 

(iii) HYPOTHETICAL: no 

(iv) ANTI SENSE : no 
(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) POSITION: 28.. 1644 
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(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) POSITION: 28. .1644 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCATTA AAGAGGAGAA ATTAACC ATG AAA TAC CTA TTG CCT ACG GCA 

Mec Lys Tyr Leu Leu Pro Thr Ala 
I 5 

GCC GCT GGC TTG CTG CTG CTG GCA GCT CAG CCG GCC ATG GCG CAG GTG 
Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala Gin Val 
10 15 20 

CAA CTG CAG CAG TCT GGG GCT GAA CTG GCA AGA CCT GGG GCC TCA GTG 
Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val 
25 30 35 40 

AAG ATG TCC TGC AAG GCT TCT GGC TAC ACC TTT ACT AGG TAC ACG ATG 
Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr Thr Met 
45 50 55 

CAC TGG GTA AAA CAG AGG CCT GGA CAG GOT CTG GAA TGC- ATT GGA TAC 
His Trp Val Lys Gin Arg Pro Gly Gin Glv Leu Glu Tro lie Glv Tvr 
60 65 70 

ATT AAT CCT AGC CGT GGT TAT ACT AAT TAC AAT CAG AAG TTC AAG GAC 
lie Asn Pro Ser Arg Gly Tvr Thr Asn Tvr Asn Gin Lys Phe Lvs Aso 
75 80 85 

AAG GCC ACA TTG ACT ACA GAC AAA TCC TCC AGC ACA GCC TAC ATG CAA 
Lys Ala Thr Leu Thr Thr Aso Lvs Ser Ser Ser Thr Ala Tvr Met Gin 
. 90 95 100 

CTG AGC AGC CTG ACA TCT GAG GAC TCT GCA GTC TAT TAC TGT GCA AGA 
Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tvr Tyr Cvs Ala Arg 
105 110 115 120 

TAT TAT GAT GAT CAT TAC AGC CTT GAC TAC TGG GGC CAA GGC ACC ACT 
Tvr Tyr Asp Aso His Tyr Ser Leu Aso Tyr Trp Gly Gin Gly Thr Thr 
125 " 130 135 

CTC ACA GTC TCC TCA GCC AAA ACA ACA CCC AAG CTT GGC GGT GAT ATC 
Leu Thr Val Ser Ser Ala Lvs Thr Thr Pro Lys Leu Glv Glv Asd lie 
140 145 150 

TTG CTC ACC CAA ACT CCA GCT TCT TTG GCT GTG TCT CTA GGG CAG AGG 
Leu Leu Thr Gin Thr Pro Ala Ser Leu Ala Val Ser Leu Gly Gin Arg 
155 160 165 

GCC ACC ATC TCC TGC AAG GCC AGC CAA AGT GTT GAT TAT GAT GGT GAT 
Ala Thr lie Ser Cys Lvs Ala Ser Gin Ser Val Asp Tvr Aso Gly Asp 
170 175 180 
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AGT TAT TTG AAC TGG TAC CAA CAG ATT CCA GGA CAG CCA CCC AAA CTC 627 

Ser Tyr Leu Asn Trp Tyr Gin Gin lie Pro Gly Gin Pro Pro Lys Leu 
185 190 195 200 

CTC ATC TAT GAT GCA TCC AAT CTA GTT TCT GGG ATC CCA CCC AGG TTT 675 
Leu lie Tyr Asp Ala Ser Asn Leu Val Ser Gly lie Pro Pro Arg Phe 
205 210 215 

AGT GGC AGT GGG TCT GGG ACA GAC TTC ACC CTC AAC ATC CAT CCT GTG 723 
Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Asn lie His Pro VaL 
220 225 230 

GAG AAG GTG GAT GCT GCA ACC TAT CAC TGT CAG CAA AGT ACT GAG GAT 771 
Glu Lys Val Asp Ala Ala Thr Tyr His Cys Gin Gin Ser Thr Glu Asp 
235 240 245 

CCG TGG ACG TTC GGT GGA GGC ACC AAG CTG GAA ATC AAA CGG GCT GAT 819 
Pro Trp Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys Arg Ala Asp 
250 255 260 

GCT GCG GCC GCT GGT GGC CCA GGG TCG CAG GTG CAC- CTG CAG CAG TCT 867 
Ala Ala Ala Ala Gly Gly Pro Gly Ser Gin Val Gin Leu Gin Gin Ser 
265 270 275 280 

GGG GCT GAG CTG GTG AGG CCT GGG TCC TCA GTG AAG ATT TCC TGC AAG 915 
Gly Ala Glu Leu Val Arg Pro Gly Ser Ser Val Lys lie Ser Cys Lys 
285 290 295 

GCT TCT GGC TAT GCA TTC AGT AGC TAC TGG ATG AAC TGG GTG AAG CAG * 963 

Ala Ser Gly Tyr Ala Phe Ser Ser Tyr Trp Met Asn Trp Val Lys Gin 
300 305 310 

AGG CCT GGA CAG GGT CTT GAG TGG ATT GGA CAG ATT TGG CCT GGA GAT 1011 
Arg Pro Gly Gin Gly Leu Glu Trp lie Gly Gin He Trp Pro Gly Asp 
315 320 325 

GGT GAT ACT AAC TAC AAT GGA AAG TTC AAG GGT AAA GCC ACT CTG ACT 1059 
Gly Asp Thr Asn Tyr Asn Gly Lys Phe Lys Gly Lys Ala Thr Leu Thr 
330 335 340 

GCA GAC GAA TCC TCC AGC ACA GCC TAC ATG CAA CTC AGC AGC CTA GCA 1107 
Ala Asp Glu Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Ser Leu Ala 
345 350 * 355 360 

TCT GAG GAC TCT GCG GTC TAT TTC TGT GCA AGA CGG GAG ACT ACG ACG 1155 
Ser Glu Asp Ser Ala Val Tyr Phe Cys Ala Arg Arg Glu Thr Thr Thr 
365 370 375 

GTA GGC CGT TAT TAC TAT GCT ATG GAC TAC TGG GGT CAA GGA ACC TCA 1203 
Val Gly Arg Tyr Tyr Tyr Ala Met Aso Tyr Trp Gly Gin Gly Thr Ser 
380 385 390 

GTC ACC GTC TCC TCA GCC AAA ACA ACA CCC AAG CTT GGC GGT GAT ATC 12 51 

Val Thr Val Ser Ser Ala Lys Thr Thr Pro Lys Leu Gly Gly Asp He 
395 400 405 
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GTG CTC ACT CAG TCT CCA GCA ATC ATG TCT GCA TCT CCA GGG GAG AAG 1299 
Val Leu Thr Gin Ser Pro Ala He Met Ser Ala Ser Pro Gly Glu Lys 
410 415 420 

GTC ACC ATG ACC TGC AGT GCC AGC TCA AGT GTA AGT TAC ATG AAC TGG 1347 
Val Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met Asn Trp 
425 430 435 440 

TAC CAG CAG AAG TCA GGC ACC TCC CCC AAA AGA TGG ATT TAT GAC ACA 1395 
Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys Arg Trp He Tyr Asp Thr 
445 450 455 

TCC AAA CTG GCT TCT GGA GTC CCT GCT CAC TTC AGG GGC AGT GGG TCT 1443 
Ser Lys Leu Ala Ser Gly Val Pro Ala His Phe Arg Gly Ser Gly Ser 
460 465 470 

GGG ACC TCT TAC TCT CTC ACA ATC AGC GGC ATG GAG GCT GAA GAT GCT 1491 
Gly Thr Ser Tyr Ser Leu Thr He Ser Gly Met Glu Ala Glu Asp Ala 
475 480 485 

GCC ACT TAT TAC TGC CAG CAG TGG AGT AGT AAC CCA TTC ACG TTC GGC 153 9 

Ala Thr Tvr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr Phe Gly 
490 * 495 500 

TCG GGG ACA AAG TTG GAA ATA AAC CGG GCT GAT ACT GCA CCA ACT GGA 1587 
Ser Gly Thr Lys Leu Glu He Asn Arg Ala Asp Thr Ala Pro Thr Gly 
505 510 515 520 

TCC GAA CAA AAG CTG ATC TCA GAA GAA GAC CTA AAC TCA CAT CAC CAT 163 5 

Ser Glu Gin Lvs Leu He Ser Glu Glu Asp Leu Asn Ser His His His 
525 530 535 

CAC CAT CAC TAATCTAGA 1653 
His His His 



(2) INDICATIONS AS TO ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 amino acids 

(B) KIND: amino acid 
(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 

1 5 10 15 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Ala Glu 

20 m 25 30 

Leu Ala Arg Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly 

35 40 45 
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Tyr Thr °he Thr Arg Tyr Thr Met His Trp Val Lys Gin Arg Pro Gly 
50 ^55 60 

Gin Gly Leu Glu Trp lie Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr 
65 70 75 80 

Asn Tyr Asn Gin Lys Phe Lys Asp Lys Ala Thr Leu Thr Thr Asp Lys 
85 90 95 

Ser Ser Ser Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp 
100 105 HO 

Se** Ala Val Tyr Tyr Cys Ala Arg Tyr Tyr Asp Asp His Tyr Ser Leu 
115 120 12S 

Asp Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr 
130 135 140 

Thr Pro Lys Leu Gly Gly Asp lie Leu Leu Thr Gin Thr Pro Ala Ser 
145 150 155 160 

Leu Ala Val Ser Leu Gly Gin Arg Ala Thr He Ser Cys Lys Ala Ser 
165 170 175 

Gin Ser Val Asp Tyr Asp Gly Asp Ser Tyr Leu Asn Trp Tyr Gin Gin 
180 185 190 

He Pro Gly Gin Pro Pro Lys Leu Leu He Tyr Asp Ala Ser Asn Leu 
195 . 200 205 

val Ser Gly He Pro Pro Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
210 215 220 

Phe Thr Leu Asn lie His Pro Val Glu Lys Val Asp Ala Ala Thr Tyr 
225 230 235 240 

His Cys Gin Gin Ser Thr Glu Asp Pro Trp Thr Phe Gly Gly Gly Thr 
245 250 255 

Lvs Leu Glu He Lvs Arg Ala Aso Ala Ala Ala Ala Gly Gly Pro Gly 
260 * ~ 265 270 

Ser Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg Pro Gly 
275 280 285 

Ser Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Ala Phe Ser Ser 
290 295 300 

Tyr Trp Met Asn Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp 
305 310 315 320 

lie Gly Gin He Trp Pro Gly Asp Gly Asp Thr Asn Tyr Asn Gly Lys 
"325 " 330 335 
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?he Lys Gly Lys Ala Thr Leu Thr Ala Asp Glu Ser Ser Ser Thr Ala 
340 345 350 

Tyr Mec Gin Leu Ser Ser Leu Ala Ser Glu Aso Ser Ala Val Tyr Phe 
355 360 365 

Cys Ala Arg Arg Glu Thr Thr Thr Val Gly Arg Tyr Tyr Tyr Ala Met 
370 375 380 

Asp Tyr Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser Ala Lys Thr 
385 390 395 400 

Thr Pro Lys Leu Gly Gly Asp He Val Leu Thr Gin Ser Pro Ala He 
405 410 415 

Met Ser Ala Ser Pro Gly Glu Lvs Val Thr Met Thr Cys Ser Ala Ser 
420 425 430 

Ser Ser Val Ser Tyr Met Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser 
435 440 445 

Pro Lys Arg Trp He Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val Pro 
450 455 460 

Ala His Phe Arg Glv Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He 
465 470 475 480 

Ser Gly Met Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp 
485 490 495 

Ser Ser Asn Pro Phe Thr Phe Gly Ser Gly Thr Lys Leu Glu He Asn 
500 505 510 

Arg Ala Asp Thr Ala Pro Thr Gly Ser Glu Gin Lys Leu He Ser Glu 
515 520 525 



Glu Asp Leu Asn Ser His His His His His His 
530 535 



(2) INDICATIONS AS TO ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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TATATACTGC AGCTGCACCT GCGACCCTGG GCCACCAGCG GCCGCAGCAT CAGCCCG 

(2) INDICATIONS AS TO ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 



(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCGTGAATTC CAGGTGCAAC TGCAGCAGTC TGGGGCTGAA CTGGC 



(2) INDICATIONS AS TO ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 
<D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GGTCGACGTT AACCGACAAA CAACAGATAA AACG 
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(2) INDICATIONS AS TO ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: genome DNA 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE : no 
<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1..348 
(ix) FEATURE: 

(A) NAME /KEY; mat_peptide 

(B) POSITION: 1..348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATG AGA TTT CCT TCA ATT TTT ACT GCT GTT TTA TTC GCA GCA TCC TCC 
Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 15 

GCA TTA GCT GCT CCA GTC AAC ACT ACA ACA GAA GAT GAA ACG GCA CAA 
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 

ATT CCG GCT GAA GCT GTC ATC GGT TAC TCA GAT TTA GAA GGG GAT TTC 
He Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 

GAT GTT GCT GTT TTG CCA TTT TCC AAC AGC ACA AAT AAC GGG TTA TTG 
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

TTT ATA AAT ACT ACT ATT GCC AGC ATT GCT GCT AAA GAA GAA GGG GTA 
Phe He Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

TCT CTC GAG AAA AGA GAG GCT GAA GCT GAA TTC CAG GTG CAA CTG CAG 
Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Gin Val Gin Leu Gin 
85 90 95 

CAG TCT GGG GCT GAA CTG GCA AGA CCT GGG GCC TCA GTG AAG ATG TCC 
Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val Lys Met Ser 
100 105 110 

TGC AAG GCT TCT 
Cys Lys Ala Ser 
115 
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2) INDICATIONS AS TO ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) KIND; amino acid 
(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Arg Phe Pro Ser He Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 1£ 

Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 

He Pro Ala Glu Ala Val lie Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 

Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

Phe He Asn Thr Thr lie Ala Ser He Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Gin Val Gin Leu Gin 
85 90 95 

Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val Lys Met Ser 
100 105 110 

Cys Lys Ala Ser 
115 



(2) INDICATIONS AS TO ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: genome DNA 



(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1..354 
(ix) FEATURE: 

(A) -NAME/KEY: mat_peptide 

(B) POSITION: 1 . . 354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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ATG AGA TTT CCT TCA ATT TTT ACT GCT GTT TTA TTC GCA GCA TCC TCC 
Met Arg Phe Pro Ser He Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 15 



48 



GCA TTA GCT GCT CCA GTC AAC ACT ACA ACA GAA GAT GAA ACG GCA CAA 
Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 



96 



ATT CCG GCT GAA GCT GTC ATC GGT TAC TCA GAT TTA GAA GGG GAT TTC " 
He Pro Ala Glu Ala Val He Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 



144 



GAT GTT GCT GTT TTG CCA TTT TCC AAC AGC ACA AAT AAC GGG TTA TTG 
Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 



192 



TTT ATA AAT ACT ACT ATT GCC AGC ATT GCT GCT AAA GAA GAA GGG GTA 
Phe He Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 



240 



TCT CTC GAG AAA AGA GAG GCT GAA GCT GAA TTC ATG GCG CAG GTG CAA 
Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Met Ala Gin Val Gin 
85 90 95 



288 



CTG CAG CAG TCT GGG GCT GAA CTG GCA AGA CCT GGG GCC TCA GTG AAG 
Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val Lys 
100 105 110 



336 



ATG TCC TGC AAG GCT TCT 
Met Ser Cys Lys Ala Ser 
115 



354 



2) INDICATIONS AS TO ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 118 amino acids 

(B) KIND: amino acid 
(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
15 10 15 

Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 
20 25 30 



He Pro Ala Glu Ala Val He Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
35 40 45 
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Asd Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

Phe lie Asn Thr Thr lie Ala Ser He Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 

Ser Leu Glu Lys Arg Glu Ala Glu Ala Glu Phe Met Ala Gin Val Gin 
85 90 95 

Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala Ser Val Lys, 
100 105 110 

Met Ser Cys Lys Ala Ser 
115 



(2) INDICATIONS AS TO ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



TCACACAGAA TTCTTAGATC TATTAAAGAG GAGAAATTAA CC 



(2) INDICATIONS AS TO ID NO; 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE- DESCRIPTION: SEQ ID NO: 13: 
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AGCACACGAT ATCACCGCCA AGCTTGGGTG TTGTTTTGGC 



(2) INDICATIONS AS TO ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



AGCACACAAG CTTGGCGGTG ATATCTTGCT CACCCAAACT CCA 

(2) INDICATIONS AS TO ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AGCACACTCT AGAGACACAC AGATCTTTAG TGATGGTGAT GGTGATGTGA GTTTAGG 



(2) INDICATIONS AS TO ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 
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(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CAGCCGGCCA TGGCGCAGGT GCAACTGCAG CAG 33 

(2) INDICATIONS AS TO ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) KIND: nucleotide 

(C) STRAND TYPE: single strand 

(D) TOPOLOGY: linear 

(ii) KIND OF MOLECULE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: no 

(iv) ANTISENSE: no 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



TATATACTGC AGCTGCACCT GGCTACCACC ACCACCGGAG CCGCCACCAC CGCTACCACC 
GCCGCCAGAA CCACCACCAC CAGCGGCCGC AGCATCAGCC CG 



60 
102 
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Amended Claims 

1. A multivalent F v antibody construct having at least 
four variable domains which are linked with one another via 
the peptide linkers 1, 2 and 3, wherein the peptide linkers 
1 and 3 have 0 to 10 amino acids. 

2. The F v antibody construct according to claim 1, wherein 
the peptide linkers 1 and 3 have the amino acid sequence GG. 

3. The F v antibody construct according to claim 1 or 2, 
wherein the F v antibody construct is bivalent. 

4. The F v antibody construct according to claim 3, wherein 
the peptide linker 2 has 11 to 20 amino acids. 

5. The F v antibody construct according to claim 3 or 4, 
wherein the peptide linker 2 has the amino acid sequence 
(G4S) 4 . 

6. The F v antibody construct according to claim 1 or 2, 
wherein the F v antibody construct is tetravalent. 

7. The F v antibody construct according to claim 6, wherein 
the peptide linker 2 has 3 to 10 amino acids. 
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8. The F v antibody construct according to claim 6 or 7, 
wherein the peptide linker 2 comprises the amino acid 
sequence GGPGS. 

9. The F v antibody construct according to any of claims 1 
to 8, wherein the F v antibody construct is multispecif ic . 

10. F v antibody construct according to claim 9, wh«rein the 
F v antibody construct is bispecific. 

11. The F v antibody construct according to any of claims 1 
to 8, wherein the F v antibody construct is monospecific. 

12. A method of producing the multivalent F v antibody 
construct according to any of claims 1 to 11 , wherein DNAs 
coding for the peptide linkers 1, 2 and 3 are ligated with 
DNAS coding for the four variable domains of an F v antibody 
construct such that the peptide linkers link the variable 
domains with one another and the resulting DNA molecule is 
expressed in an expression plasmid. 

13. Expression plasmid coding for the multivalent F v 
antibody construct according to any of claims 1 to 11. 

14. The expression plasmid according to claim 13, namely 
pDISC3xl9-LL. 

15. The expression plasmid according to claim 13, namely 
pDISC3xl9-SL. 

16. The expression plasmid according to claim 13, namely 
pPIC-DISOLL. 
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17. The expression plasmid according to claim 13, namely 
pPIC-DISC-SL. 

18. The expression plasmid according to claim 13, namely 
pDISC5-LL. 

19. The expression plasmid according to claim 13, namely 
pDISC6-SL. 

20. Use of the multivalent F v antibody construct according 
to any of claims 1 to 11 for the diagnosis and/or treatment 
of diseases. 

21. Use according to claim 20, wherein the diseases are 
viral, bacterial or tumoral diseases. 



CA 02331641 2000-11-03 



5/10 



EcoRI RES ?sl8 leaoer N COj 

i GAATio:rrAAAG^AGAJLV^ 

1* M X 7 ~. L ? 7 A A G 1 L L L A A Q ? A M 

f-ame-Hl vh anii-C03 

22 ►A Q V 0 t- Q 0 S G A £ L A & ? G A S V X M S C K A S G V T ? t 
C0R-H1 Frame-H2 CDR-H2 

133 TA GG T AC A C G AT GC AC TGGGTAA^A^G^GGCCTt^^^O IVr^AATC^A'TTrr^ T A C A 7T AATCC TACCCGTCCTT.X TAg 
32 > X Y 7 M K W V K Q X ? Z Q G L £ W I G Y I N ? S R G V ? 

357 T AA T T A C AA T C A G AA G T 7 C; AA G G A C>AGGCC.^C\TTCL : ^CTAC^CI^CAA^TC 3TC C AGC^C^GC CT AC-.TGC-ACTG.~GC--.GC C7GAC 
3C> N 7 M Q 1\ r X D X A 7 I* 7 7 2 X 5 S STAY>!Gv»55L7 

CDR-H3 r»£me-H4 
354 ATCT3AGC-.CTCTGCAGTCT TATTATG ATGAT C A TT AC AGC C TTG A C T A CT C^-^CA^GC^rClt^T-^ a 

1C9> S z D 5 A V Y Y C A * V D D 3 Y S 1 D Y W 3 0 G T 7 L 

CHI Unfrer J Frame-Li >/L anti-CD 19 

44 0 caGTra^A GC^AAA^^^ - ■ " V ■ & a ar-~ ar, *GSGCa« 

138 ► T V 5 5 A X 7 T ? ?. ;* G G D I L L 7 Q T ? A S L A 7 3 L G Q 

CDfi-LI =:ame-L2 
S30 GCC<rC^CA.TC!rCTGCAAGGCCAGCCAJ^GTGT?GA??>^ ^^ 

163> X A T I 5 C X A £ Q S V 3 Y 2 G ^ S Y L N W Y v C I ? G 

CDR-L2 rrame-L3 

514 AC^CACC^AAACTCCrTCAT^^ 

156 > Q ? ? X I* 1 I Y Z .-. S N L V SGI ???. 7SG53SG7D7 

C0R-L3 r.'ame-L A 
TO; C^CCCTClv.CATCCATCCTGT^ CAAAGTAC TG AGG A ^ GGTSGAC GT TC G GT G GA 

Z2l> T 1 M I H ? V I * '/ D A A T Y H C Q 3 S T E C ? W 7 ? G G 
Ckacca Noll Linker 2 

790 C<Kl\C:iAACCTtKLAAJ^^ 

253 > G 7 X L £ I X X A 3 A A A A G G G G S G G G G S G G G G 

Pvull Frame-H1 VH anti-C019 

874 rCCggrGG7GCr(7<yr*gCCAGGTGCACCTGC^^ /oAGgeriG-C-GT'C C7C^jgTCAAjC\TrrGrrCO^C-G 

283> S 0 G G G S Q V Q L Q Q S G A S L V X ? G S S V X I £ C X 

COfi-Hl Frame-H2 C3R-H2 

962 CT7C7CC<r7A7G^A77CAGT ACC^^^ 

312>A 3 G Y A F 5 S Y W M N W V X Q R ? 3 C* 3 L S W I G 0 - » 

Pstl Frame«H3 

104? C7GGAGATGGTGATAC7AACTAa\A?GCAAAGTTCAAGCGTA AAC^^ 
34i> ? G D G D T M Y G X ? "< G XA7LTADE3S57AY 

CDR-H3 

1133 T^\ACT^GCAC<XTAGCATC^ 
36?> M C L S 5 L A 5 Z r S A V Y ? C A R H E T T 7 V G ?» Y Y Y 
Frame-rU CHI UnAer / Frame-Li 

1215 GCTAT G G AC T A CTC<^7rCAAGCyA-^ GGCGCr3A7ATCGTGC7C^CTG 
3S6> A M D Y W G C G 7 5 V T V S S A K 7 7 ? K L G G D I V L T 
VL anti-C03 CDR-Li 
1307 AG7C7C CAGCAATCATG7C7 GC ATC TCCAGGG^aAGAAGGTCAC C^HGACCToC A C 7 G C C A GCTCAA GTG T AAG 77 A C A7G AACT tS 
42?>Q S ? A _ M S A 5 ? G H X V 7 M 7 C 5 A S S 5 V 5 Y M M W 
Frame-L2 C0R-L2 Frame-L.3 

13? 3 7ACC^JGCAGAAGTCAGC<1ACCTCCCCCA^ 
456> YQQXS GTS?KXWZYDT 5 X L A S G V ? A K r X G 

CDR-L3 

1481 GTGGv't n - :x^-Ar— r^^r-r^r—^r^ATr ^r^\Trru^-^A agatt.-^— ^r^a^Ar^vr agcag tgg agt agt aa 
485>S GSGT SYSL7ISGM2AE0AATYYCQ QWS S N 
F'ame-L4 CXappa c-rnyc epitope 

1569 CCCA77CACG TTCX5C^^::C<^.CAA^^ 
514> ? T 77GSG7XLSINXAD7APTGS E G X L Z S 

Hisd tail Xbal 
1653 AA GXAGA CC r>lAAC^A C^.TCACCA.7C^CATCXC^ AATCTA^ 
343>E E D L N S H H H H H H • 



FIGURE 5 
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Ecofll R8S Pe{8 leaaer Nco , 

1> « X Y L ? T A A A G L L L L A A Q .? A M 
♦ Frame-Hi VH anti-C03 

92 C3C'-.GGTGCVACTC^\GCLV3^^ 

22>A Q V 0 L Q Q S v A 2 A X ? G A S 
C0R-H1 Frame-H2 
132 TAGCTACACCATCCACTGGCrr;^-^^^^ 
32 > ?. V T M H W 7 XQRPGQGLSWIG Y I N ? S F. G Y T 

Frame -H3 

S&^SSAjLAASSA^tS^^ 

SO N Y N Q K r X 3 X A T L T T 3 X 5 5 5 T A Y M Q 1 5 £ L T 

C0R-H3 Frame-H* 

354 ATrrGAG^cTCTCOAGTCTAT^^^ 

10?> S £ 3 5 A V Y Y C A ?. '! Y D 2 H Y S L D Y M G Q G r 7 L 

CH1 L/flAfe/- ) Frame-Li VL anti-CDl9 

440 CnGTCTCC?CA G^AAA^ga&£a£I^^ CCTT CCC g C K-ATA T ! TTgCTCL-.CCrAAACTCCAG T T T * G GCTGTGTCTCTAGGGO.G.t 
123>T V 5 S A K T 7 = S L G G D Z L L T C T ? A 5 L A 7 S - S 0 

C0R-L1 rrame-L2 
=20 GGCCCACCVZCTCCTGC A«AG GC C A GC C AAA G T G T T G A TT A TG A TG G TG A T A G T T A T TT G AACT GGTAC TAAC-Il-.TTC 7AGGAC 
-=c> ?. A T I S C X A S C 5 7 D Y 2 G T 5 Y L X W Y 3 Z ' ? G 

C3R-L2 Frame-L3 

514 AGC CACCCAA.-.CTC - 1TCATCTATG ATGCATCC AA T C T AG ^T£J GT^ATC CZ^jCCCAGGTTTAGTGGCAGTGGSTC rGGGACAGACTT 
1?6> Q ? ? >: L L Z Y Z A 3 >J L V 5 5 I ? ? ?. r 3 G 5 G S G 7 D T 

CDR-L3 Frame- 1^ 

702 C^rCCrrCAACATCCATCCTG^rGGAGAAG^^ G T AC TG A G G A TC C3TGGA I L- ♦ " *L J L - 1 -U i A 

225 > T L N I K ? V2XVEAA7YHCQ Q S T 2 D ? W T : G G 

CVaoca Noil L/'nKer J Pvull Frame-Hi 

790 GGCAC ^\AGCTGGAAATCAAA CC-0CC^.T7<r^ ^ 

25 5> G T K L E I X ?. A D A A A A G G ? G S QVQLQQSGASL 

VH anti-CD19 CDR-H1 ."rame-H2 

37? GGTGAGGC:TZGG<riC:riCA^ 

2S4> V R ? G S S V K : S C K A 3 G Y A r S S Y W M N W V X Q X 

CDR-H2 

963 CTGGAC-JL^-^'l'l'^AGTCGATTC^ 

3i4> ? G Q G L S W I G Q I W ? G D G D T N Y N G K X G X A 

Frame-Ho 

1031 ACTCTCACTt^GACaVvTCCTC^^ 
342 ► T L T A D 2 S SSTAYJ1QLS S L A 5 £ D S A V Y ? C A X 
C0R-H3 Frame-H4 CH1 

1142 G G G AG AC T AC G AC G G TAG GC C G?T A TT A CT AT G C T A TC GACTAC TGGGTJntl\AGGAACCTC\GTCACC G 7C. Z r C AGCCAAAA, 
372 > X 2 T 7 T V G 3 Y Y Y A M 0 Y WGQGTSVTVSSAX 
Linker 1 Frame-Li VL ami-CD3 
122 6 CAA£2£^AAGCTT G GCGG^AT^.TCGTGCT^^CTOjGTCICC^^ 
J00>7 T ? X L G G D:VL?GS?AIMSAS?GSXV7M?C 

C0R-L1 Frame-L2 CDR-L2 

1316 G T G C C AG C TC AA G T G T AAGTT A C A TG AA Q r TC^XC^lXC^^(X^ ^(TTT^r^r \ p.~TY»~~~ ~ \ a AArL^TCr^ATT*7ATG A C A C A T C C AA 
430>S A3 S S V SYMNWYCQXSGTSPXXWIYDTS X 
Frame-L3 

1401 ACTGGCTTC T GGAGTCC rTGC!^ CTTVC^I^^^yfTTCC-CrC^V^C- A f ' " ' T ^f^^rv-^r ^<Tr-ACTf^ 

453> L A SGVPAHrXGSGSGTSYSLTlSGMcAIDA 
C0R-L3 Frame-L4 C kapoa 

1491 TGCCACTTATTACTGC C A GC AGTGGAGTAGTAAC7 CAT TC AC S T^rryy-^rryry Ar XAArrTOSAAATAAAC ^GT!tl-.TACrC-C 
488> A T Y Y C Q Q W S 5 M ? ? T G £ G V X L Z 1 M X A 0 T A 

c-myc epitope His 6 (ail Xbal 

1578 ACCAACTG GA3CC GAA CAAAA GCTGA TCTCA GAA CAA OA CC TAA.\CTCAC^ TC.^.CC ^ ^ r^_\TC.^.C7 AATCTAGA 
517 > ? T G S E Q X L I S £ I D L M 5 H H H H K H • 
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941 A T GAGATTTCCTIXT AATTTTTACIXXIU'I' I'i'I'ATTCGC AGCATC CTCCGC^.TTAGCTGCTCC^GTC^AC^CTAC 
!► M RFPS I FTAVLFAASSA LAAPVN TT 

alpha-factor signal 

1015 aac^gaagatgaaacggcacaaattcc 
25> tedeta qi paeavi gysdlegdfd 

1089 ttc<:tgttttgccattttc 
50> va vlpfsnstnngllf i nttiasia 

EcoRI 

Xhol ♦ ♦ 

1163 GCTAAAGAJ^.GAAGGGGTATCTCTCGAGAAAAGA T G C AAC T GC AG C AG T C 

75* AKEEGVSLEKREAEAEF Q V Q L QQ S 

VH anti-CD3 

1234 TGGGGCTGAACTGGCAAGACCTGGGGCCTCAGTGAAGATGTCCTGCAAGGCTTCT 
9S> G A ELA RPGASVKMSCK AS 
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941 AT GAGATTTCCTTCaJv^ ACACTAC 
1>M RFPSI FTAVLFAASSA LAAFV N Y T 

alpha-factor signal 

1015 AACAGAAGATGAAACGGCACAJ^TO 

25> TEDETAQI PAEAVI GYSDLEGDFD 

BsrDI 

1089 TTGCTGTTTTGCCATTTTC^ 

50> VAVLPFSNSTNNGILF I NTTlASIA 

EcoRI 

Xhol ♦ • 

1163 GCTAAAGAJ^GAAGGGGTATCTCTCGAGAArJ^Gn 

75 ►A KE EGV S L EK REA EA E FMA Q V Q L 0 

VH anti~CD3 

1235 CAGTCTGGGGCTGAACTGGCAAGACCTGGGGCCTCAGTGAAGATGTCCTGCAAGGCTTCT 
99 ►QS GA E L A RP GA SVKM S CKAS 
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